#LLM compression 共 3 个条目 论文 (3) Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score Demystifying When Pruning Works via Representation Hierarchies Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach